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I. Harini Shandilya, DO HEREBY DECLARE AND SAY: 

THAT, I am presently employed by Life Technologies, Inc. (hereinafter "Life 
Technologies," and now Invitrogen Corporation)', the assignee of the above-captioned 
appHcation as a Scientist. I have an M.S. degree in chemistry. I have been employed by Life 
Technologies since June 1, 1988. 

THAT,oneofmysupervisofsatLife Technologies has beenDebK. Chatterjee. During 
the time period the experiments described in this Declaration took place, I worked in close 
physical proximity to Deb Chatterjee. During this time period, I had at least a generalized 
knowledge of Deb Chatterjee's research projects. 

THAT, at times Deb Chatterjee would ask me to witness his laboratory notebooks. When 
he asked me to witness laboratory notebook pages, I would carefully read and understand the 
content of those pages, then sign and date the pages at the lower lefthand comer. Deb 

•.K T Technologies. Inc. merged with Invitrogen Corporation on September 12 2000 

with Invitrogen Corporation being the surviving entity. 
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Chatteijee-snotebookwou]doftencontainoriginaldata.e.g.,photographsof gels. When 

data was included in notebook pages that I witnessed, I would review the data together with the 

remaining content of the pages. 

THAT, prior to October 17, 1994, 1 was aware that Deb Chatterjee had worked on a 
research project relating to mutant DNA polymerases. I was aware of this research project from 
various sources. These sources included informal discussions in the laboratory with Deb 
Chatterjee, discussions I heard between Deb Chatterjee and othei^, and my .^view of Deb 
Chatterjee's notebook, including original data included in that notebook. 

Attached to this Declaration are several Exhibits containing laboratory notebook pages 
recorded by Deb Chatterjee. During the time period these experiments were conducted, as 
indicated on the notebook pages, I saw Deb Chatterjee performing molecular biology 
experiments. The nature of these experiments is consistent with the notebook entries and the 
cloning of a mutant DNA polymerase gene. 

Attached to this Declaration as Exhibit A is a copy of a laboratory notebook page 

recordedbyDebChatterjee.AtthebottomlefthandcomerofthisnotebookpageIrecognizemy 
signature, indicating that I witnessed and understood this page on the date indicated. While the 
date is masked on the attached Exhibit, the date on the original is before October 17. 1994. 
Therefore. I conclude that this notebook page was in existence, and its content was 
communicated to me, before October 17, 1994. 

The content of Exhibit A relates to a research plan to make Taq DNA polymerase 
mutants. I note that this page indicates that a distinctive difference between T7 polymerase, 
which utilizes dNTP and ddNTP equally well, and Taq polymerase in the presence of a Tyr in 
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lieu of a Phe at a location in the O-helix corresponding to Phe,, of £ coli polymerase I (Klenow 
fragment). This page also indicates to me that Taq DNA polymerase mutants would be made 
based on these differences, in order to construct a Taq polymerase that is more T7-like. e.g.. 
would use dNTPs/ddNTPs equally well. I conclude that this page teaches a plan for making a 
Taq DNA polymerase mutant containing Tyr. in lieu of Phe at a position corresponding to the 
Phe,, of E. coli polymerase I (Klenow fragment). Thus, the concept of this mutation had been 
communicated to me prior to October 17, 1994. 

Attached to this Declaration as Exhibit B is a copy of two laboratory notebook pages 
recorded by Deb Chatterjee. At the bottom lefthand comer of the notebook page I recognize my 
signature, indicating that I witnessed and understood the page on the date indicated. While the 
date is masked on the attached Exhibit, the date on the original beside my signature is before 
October 17, 1994. Therefore, I conclude that these notebook pages were in existence, and their 
content was communicated to me before October 17, 1994. 

The content of Exhibit B relates, among other things, to Taq mutant F667Y. The 
nomenclature ■•F667Y" refers to a substitution of Tyr (Y) for Phe (F) at position 667 of the Taq 
polymerase. This substitution is the same substitution referred to in Exhibit A. However, in this 
Exhibit the exact amino acid residue of Taq polymerase, instead of the corresponding E. coli 
polymerase I (Klenow fragment) residue, is exphcitly identified. Thus, the concept of a specific 
mutation, Phe«,, -> Tyr^^ of Taq polymerase, was conveyed to me prior to October 17, 1994. 

Exhibit B also describes a mutagenesis experiment that was actually performed. In this 
experiment, oligonucleotide 2680 was annealed to a single-stranded DNA containing the 
wild-type Taq polymerase gene at 70°C for 2 minutes followed by cooling to about 35«C over 
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30-40 minutes. The resulting hybridized molecule was incubated with T7 polymerase and T4 
ligase to synthesize and ligate the strand complementary to the single-stranded DNA using 
oligonucleotide 2680 as a primer, and then used to transform bacteria. DNA was then isolated 
from the transformed bacteria. This DNA was assayed for the presence of an additional Asel 
restriction site derived from the oligonucleotide sequence. The presence of this additional 
restriction site would indicate incorporation of the oligonucleotide sequence, containing the 
F667 Y mutation, into the mutagenized clone. Page 2 of Exhibit B contains original data showing 
the results of an assay of various mutagenized clones for the presence of the additional 590 bp 
Asel restriction fragment, which would indicate the presence of the Asel restriction site derived 
from the oligonucleotide sequence. 

This page of Exhibit B is signed by me as having been witnessed and understood by me 
prior to October 17, 1994. Therefore, I would have reviewed this original data prior to October 
17, 1994. Thus, experiments designed to construct this mutation and test it for the incorporation 
of an additional Asel restriction site were performed prior to October 17, 1994. 

Attached to this Declaration as Exhibit C is a page from Deb Chatterjee's notebook 
depicting an assay of additional clones for the presence of the additional Asel restriction site. I 
have independently reviewed the original data attached to this page of Deb Chatterjee's notebook. 
Based on this independent review, I conclude that clone 8 contains an additional Asel restriction 
site. 

THAT, I have reviewed the oligonucleotide sequence identified as No.2680 on the 
Request for Synthesis of Oligonucleotides, which is attached to this Declaration as Exhibit D. 
I have also reviewed the publication of Lawyer et al., J Biol. Chem. 264(1 1):6427-6437 (1989), 
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which is attached to this Declaration as Exhibit E. In Figure 2 of Lawyer et aL, the DNA 
sequence of the Tag polymerase gene is disclosed. I have compared the DNA sequence in Figure 
2 of the Lawyer article surrounding codon 667 with the sequence of oligonucleotide 2680 of 
Exhibit D to this Declaration. Based on this comparison, I have determined that the two 
sequences are complementary with two exceptions: (1) oligonucleotide 2680 contains a sequence 
complementary to a Tyr codon instead of the Phe codon in the sequence of Figure 2; and (2) 
oligonucleotide 2680 contains a sequence corresponding to an Asel restriction site not found in 
the sequence of Figure 2. 

THAT, I hereby declare that all statements made herein of my own knowledge are true; 
and that all statements made on information and belief are believed to be true; and further that 
these statements were made with the knowledge that willful false statements and the like so made 
are punishable by fine or imprisonment, or both, under Section 1001 of Title 18 of the United 
States Code; and that such willful false statements may jeopardize the validity of the application 
or any patent issued thereon. 
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DNA Polymerase Gi&iiefi:omTi^^^ '^''"•^ - • ' ' • 



Franc s C. Lawyer^ Susnnnd i ffe^^^^^^^ K&^T^9\h Myambot^j .rt • - : 
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The thermostable properties oi the! I>NA pblj^erffse ; 
a tivlty from Thermus aquaiicus (Tai7)*h&ve cofttribj - 
uted greatly to the yield, spednclty r aUtbteflttibh;, and/ 
utility of the polymerase chkln redction'ttiethdd for 
amplifying DNA« We report thfe^clbniiig and ifejcptebsian*: 
of Tag DNA polymerase in EscHisrichia eoU. From a 
XgtllrTag library we identified a TdgiDNAlfragnifenC \ 
encoding nn epitope of Taq DNA polymerase Via ^nti-. 
b dy probing. The fusion protein from the Xgtll:rag;:. 
candidate selected nn antibody from an anti-QHaa pOr 
lymcruse polyclonal antiserum which reacted with TViq 
polymerase on Western blots..\ytt.uscd the Xgtl 1 clone 
to id ntify Taq polymerase cloiies from a XGh35;Tag 

library. ^•::-:h ^t- --'Uvfift " 

The complete Taq DNA polymerase gene :has.?5^^^^ 
base pairs. From the predicted 852-omin9 : acid se- 
quence of the Taq DNA polymerase gene, Taq DNA 
polym rase has significant similarity to \B. cofi DNA 
polymerase 1. We subcloned and expressed appropriate 
portions of the insert from a XChSS'library candidate 
to yi Id thermostable, active, truncated; or full-lehgth 
forms of the protein in R co/i under control of the /ac 
promoter. 



Taq DNA polymerase (Taq Pol I)' isolated from Thermus 
aquaticus has been shown to be highly useful in the polymer- 
ase chain reaction (PGR) method (I. 2) of amplifying DNA 
fragments (3). The high temperat^ure optimu.m. activity, 75 "C. 
aff rds unique advanUges when comparing Taq -Pol I to 
Escherichia coli DNA polymerase L High specificity of primer 
binding at the elevated temperature gives . a highec yield of 
the desired product with less nonspecific amplification prod- 
uct. Also. E, coil DNA polymerase I is inactivated at 93-95 'G, 
the temperature range required to denature tjbe duplex DNA 
product. Since Taq Pol I is stable at 93-95 'C. one can add 

• The costs of publication of this article were defrayed in part-by 
the payment of page charges. This article must therefore be hereby 
marked **adu€rtisem€nt" in accordance with 18 U.S.C. Section 1734 
solely to indicate this fact. . j 

The nucleotide sequencefs) reported ir this paper has been submitted 
to the GenHank'^IEMin. Data Ra.ik with accession numbcr{s) 

I To whom correspondence and reprint requests, a Wnild -.be .-ad- 
dressed: UOO 53rd St.. Emeryville, CA 94608... : :clc - 

* The abbreviations used are: Taq Pol l, DNAcpolymerase isolated 
from r. aquaticus; kb. kilohase(s);bp, base pairs; SDS, sodium dode.cyl 
sulfate; PAGE, polyacrylamide gel electrophoresis; dNTP; deojcyri-; 
bonucleotide triphosphate; kDa. kilodalton; X-Gal; 5-bromo^^chloro- 
3-indolyl-/3-D.galactoside; IPTG. isopropyl-l-thto-]0-O-galactopyran- 
ostdc; PBS. phosphale buffered saline; TM.B, 3.3'i5.5'.nelramelhyl 
benzidine; PGR. polymerase chain reaction; Pol I.-DNA-fxiJymer- 
ase I. *■ * ' " *■ 



Taq Pol t only *A the beiffihfeg 6f th§ PCK Reaction' rather 
than before ekcTi'ro ^ ' " " ' ' ' - 

•'• A62-eS^-kDaT*aqI*6lIhaSbee^ - 
but grbwirtgr-fli^' brgariisiii IS more dHfifciflt than -coil and 
bolymeUW-ja^ias'^are^Bw S)V-Wh^f;^^^^ 
alternative purification protocol**yieldin|p|^ 
with l0-2d times higKfifr ipfeHFic ^li^HCy than that pjfevidusly- 
reported.*While the activity y{eld1s-iq[uite higK^(4G^^ 
initial expression level of Ta^ DNA polym^ in the native 
host is quite Idw (0.01-0;02% of total protein). Therefore, we 
sought toclone the Taq Pol I geneahd express the gene in E, 
coli In addition, the availability of the enzyme and the DNA 
sequence of -the Ta<7 DNA polymerase gene will faciliUte the 
study of stmctuh/function rfl and permit detailed 

■clompari^ns'Wrth me^^^ ; ' " 

MATERIALS AND METHODS'. 
RESULTS. 

X^i/l Li6>ari€i--The construction of three XgtllrTaa H- . - 
braries is described under "Materials and Methods," in the 
Minipririt, Tp maximize the probability of recovering a Taq 
Pol I epitope, three separate A /ul libraries were prepared. We 
ligated 8 mer. lO-mer. and l2-mer EcqRI linkers to the Taq 
Alul DNA fragments to ensure that each Alul fragment would . 
be in frame with respect to /J-galactosidase in one of the 
libraries. Upon screehihg with primary antibody from Taq 
Pol I-immunized rabbits and plaqufe'purification, we identified 
seven positive plaques from the I2^mer librarVi four positive 
plaques from the 10-mer library, and ho positive plaques from 
the 8-mer library. The EtoRl inserts fell into four size classes: 
two of the seven phage isolated from the I2-iier'library and 
two of the four phage isolated ffwid the 10-mer -library con-" 
Uined ll5-bp inserts, five clones from the 12-mer library had 
inserts of 175 bp (one of these also had a second apparently 
unrelated EcoRl fragment of 185 bp)v one clone frora the lO. 
mer library had 1a 125-bp insert, 'iVid ohe cloive from the 10^. 
mer library had a 160-bp insert Upon ^tibody. screening 
each of the phage reacted with Immune serUto but did not 
react with preimmune serum. "Pflabeled::t)robeja- were pre-, 
pared by PCH amplification (3)io£jDne;Cl.on.e-:each.fepm. the = :, 
115-. 175% and 125-bp size classes: The H5rbpi)robe hybrid- 
ized with all the candidates containing ll5-bp:inserts and no 
: others; Simifar1y^heI76-b'p probe hybridized withcandidates 
;. x6nta»ning..l75.-bp .insertsi.ahd the 126:.bp probe hybridized 

- .*b Gcifand and S..Stoffel, manuscript in preparation. . .. 

Portions of this paper .(including ^'Materials anil Methods," Table 
V and; Fig. 8> are presented in minlprint at the end of this paper.^ 
Miniprint is easily read with the aid of a standard magnifying glass. 
Full site photocopies are included in the* microfilm: edition of thc: 
Journal that is available from Waverly. Press.- 
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with only the candidate containinS^iat insert Subsequent' 
DNA sequencing of two 116*bp £coRI inserts, ne each from 
the 12-iner and lO-mer libraries, confirmed that they were 
identical sequences. DNA sequence analysis of Taq and flank* 
ing facZ DNA for the candidate from the l2-roer library 
indicated the presence of one EcoEt linker at its €' (ocZ 



the'mcrk or mcr^^^./icUon systems (6). The amplified 
library was subsequently plated on E. co(t strain MClOOO. 

Nine candidates were isolated and purified from the 
XCh36:ToQ libraiy. From resUiction analysis of mini DNA 
preparations, none, f thex«n<UdatS8.prqY§4.lo.berl^ 
though they all' snared some common restriction fragments. 



junction. DNA sequence analysis of the Taq and flanking facZ. . Upon Southern Wotting, the pTZl?B;l piobe hybridizedia a 

- - . T .^^ comihori 4;24tb ^omHIfragment and a common 6;6-kb Pi«r 

fragment iil all the candidates, consistent with the hybridi- 
xation seenin So&lherh blots of Taq-gen'ol^^^^^ 
For i/mdlll, the probe hybridited to fragments of different 
si^s, ranging in.sire from 6.6 to 10 kb. In additip^aU^iae 
candidates shared a cominon ^J^kb lfCfidin;fragmei^^^ '•; 

One candidate, designated ^-2, had a probe-hybridizing 
Hindlll fragment of approximately SJchrwbich corresponded 
to (He Hindlll fragment that hyb^idi^d with probe 1 in the 
Taq genomic Southern (Fig. 3). We chose this candidate for 
further study andsubcloped each of its four.dcte<*able HindllV 
fragments (A « 8 kb, B « 4.6 kb, C « 0.8 kb, and D « 0,6 kbT 
into vector BSM13* in both orientations, transforming into 



DNA for the ll6-bp candidate fWm the lO-iri^f llbraiy indi 
cated Uie presence of three £coRI linkers at the 6' (acZ 
junction, which resulted in the same fjrame. with respect to fi- 
galactosidase as that of the ]:2-mer linker candidate. Thus, 
we picked DNA fragments encoding the same epitope frpm 
tw libraries. . , . . /^ .-s. -v-r^ 

Lysogens were made of all the candidates in strain Y1089 
and were induced with isopropyl-l-thio-tf-D-galactopyrano- 
side (IPTG). Total proteins from cnidci lyiates of indir<Sed 
cultures were run on SDS-PAGE gels, and Western blots were 
prepared by using the anti^Taq Pol I antibody^fpr detection^ 
All of the clones made IPTC-inducible. iocZ^(usion proteins 
which reacted with the anti-Taq Pol I anUbody (data not 



shown) hostDQ9d,The.t!wo.suh£loneaotf^^ 
One clone each from th^ 116-; i25-,fc,~arid'^^^^^ pFC82.35 ^nd pmz:2, wefe IPTG^mduced and ci- ' 



tracts were assayed for Taq Pol I acUvity^^^ 
pFC82.35 had IPTG- inducible thermostable activity at a very 
low level, which was detectable because of the high sensitivity 
of the assay (<f molecule/lO. cell, equivalents). In contrast^ 
pFC82.2 had a significantly lower basal level of Taq Pol I 
activity which was attenuated in extracts of IPTG-grown 
cultures: ' v .--.vv---;-^ "v .;:- : 

A restriction tiiap of the A fragment was generated and is 
shown in Fig. 4. Southern analysis showed. that the Xgtll 1 
probe hybridized at one end of the A fragment. Indeed, the 
DNA sequence of the Alul genomic fragment isolated in Xgtll 
1 corresponds to nucleotides 619-720 in the Taq Pol" I gene 
(Fig. 2), Further; the \EcoRI-adapted Alul site at the junction 
between E. colt lacZ and Taq in Xgtll 1 corresponds to the lac 
promoter-proximal Ta/j Hmdill site in pFC82.35. ' 

Deletions in the A Fragment to Localize the faq Pol Gene- 
Two different deletions were made in the A fragment in 
pFC82.35 to aid in localizing the geheV Ih pFC84, approxi- 
mately 2.4 kb of the right end of the A fragment was deleted 
from the Sphl site (Fig. 4) ^ightw^^d to the 5/}/iI site. in. the 
Vector polylinker. In pFC85, approximately 5.2 kb of the right 
end of the A fragment was deleted from the y\.^p718 site 
rightward (Fig. 4) to the Asp718 site in the vector poly linkers- 
leaving 2,8 kb of TcM? insert sequence. The activity of Taq Pol 
I was assayed in extracts of uninduced and IPTG-induccd 
pFCS4-"Bnd pFC85 in DCJlOL As- can be seen in Table X 
deleting 3' sequences in t.he A fragment had a dramatic effect 
on the IPTG-inducible expression of TaajPol 1^ In addition^., 
^hile We Were unable to detect Taq Pol fin WestemiBlbtsV 
IPT(J-induced pFC82.35/DG98, indiTced immunoreactive 
bands were clearly seen upon Western blojting. o( IPTQ;./ 
induced pFC84/DGl01 and pFC86/D)3i6l (Fig. 6). In the 
Western blote, induced pFC:84/pGl01 and pFC85/DG101 
lanes, revealed doublet immunoreaetiye t^ands thleyrjwere ap^ ' 
proximately 65- and 63-kDa, These immunoreactive species 
were considerably smaller than full-length 94-kDa Taq Pol I. 
Fig. I. Immunoblots with affinity-purified antibodies pr^^^ We determined that the doublet bands we r^not irtifaeU 6f:: 

^ru^illtehtS^^^^^^^ the gel analysis because they were seen repeaUdly in several 

Taq Pol I (partially protcolyred) plus 10 /ig of gelatin were loaded on. . experiments... .... • : 

Lane A, and 10 ^ig of Taq crude extract was loaded on Lane B. LocZor fas/on5— To define further the focus of the Taq Pol 

Antibodies used to probe immOnoblots were: i, 1:10,000 dilution of | g^j^^ ^ confirm the reading frame at different sites for 
the anU-Taq Pol I polyclonal antisenim; 2, anti-Taq Pol I anUlwdy guideposU during DNA sequence analysis, we con-' , 

affinity purified with purified /»-galact^^ ltV.?^">td^ structed several fusions of the left end of the Tag HmdHI A ' 

anU-Taq Pol I antibodies affmity punfted witb eiUacta of mduced swuv-wru »^ , - . oox>fio* .f^. ti.«.A.c;^«c ar<> 

Xgtll clones 1. 3. 9, and 2-11, respectively. fragment to hcZa in the BSM13* vector. These fusions arc 



size classes was chosen for epitope selection. This method 
uses crude extracts of candidate clones to select antibodies 
from a polyclonal antiserum. These affinity-selected antibod- 
ies were used to probe Western blots of Taq Pol I. The results 
are shown in Fig. 1. In two experiments candidate Xgtll 1, 
the 115-bp insert candidate, was the only one of the four 
tested which successfuUy bound antibody;^ 
purified Taq Pol I and reacted uniqiicfy with Taq Pol I m 
crude extracts. The other three candidates, which had been 
identified and purified with the anti-Taq Pol 1 antibody; failed 
to "fish" from that same polyclonal antibody an antibody that 
would react with Taq Pol I on a Western blot. A close 
inspection of the Western blot indicates a faint cross-reaction 
with 28-30-kDa proteins in total soluble Thermos crude ex- 
tracts. The DNA sequences of these three candidates do not 
correspond to any part of the Taq Pol I DInIA sequence (Fig. 
2). 

XCh35 Libraries— The 116-bp E(fyRl fragment from done 
Xgtll 1 was subcloned into Genescribe Z vector pTZ19R to 
use as a probe in screening the \Ch3b:Taq library. Construc- 
tion of the partial SauSA digest library of Taq DNA iri XCh35 
and screening of the library are detailed under "Materials and 
Methods," in the Miniprint. The in uifro packaged library was 
plated initially on E. coli strain K802. That strain was chosen 
. to avoid the possibility of degradation of Taq insert DNA by 
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-UO XX«:TCACArCIXCCT<WCTCJWCCCCTCCCCTTCCACCfOW^ 



JktduiCCXOCTGGICCACCrCCfCtMGCTCGC^^ 



Hindus 

€0 1 CACXXGACCCCGXCGMBirCTGGJ^aXI 

CluLysThrAUArqLvsUeuLeuGlfia^uT. . . 



C^JAUCiyGluArgAUAlaLeu£c^GluArg^eu?heAlaAsnLel:T^pCly/.:9t•eJ^'lu^lyu»ul>I«/^r^L.c . . 

■ .... 
teuAlaHis«etGluAlaTrirClyVdlArqLeuAspVAlAlaTyrLeuAcgAiaL€uSerL€uCiu/a.AiaoiuK,*ui*^ ^ .. . ^ , 

u.. ccc.TC*.ccT«xcrccccc=*c4^p^occ^^ . 

l'ryiM;»:A:::iI.ouAanSorAr<jAjp-;!ra.ouulqAr'jVall.cul'heAspOij*.tiu>*i,i.oLi rcAiJ.ic^i,! ,^ y r i ^ 

Pst : Sac: . . . 

IS6. .CCCCTCCCC«i«CCC«CCci.TCCTCOXC^«TC«-in*CCC««^ . 



AlaL€uArgGluAlaHis?rolleValGluLysUel.euClnTyeArqGluLeJihr»-ysi-euLvs:>er-nr.,r,. p k ^ ' ^ 

■ ' Bamwr - " * 

.s.-': ssct • • ^ 

leOl ixCGA<^(n^ATTCCTicCCC^CCACTATAC^^ " 
CloClvGlyTipLeuUuValAl-aLcuAspTyrSerClnrieOluLeuArgVaiUuAUHisLcuSerClyAspGluAsnU^ 

1921 CACACC<W^»«CXT«TTC<WCcWcCCcij«^^ .. 

2161 ^GACCCTCTicCCCCCCCcicaCTACCTcicXCACCTA^^^ 

CluThcLe-jlhcClyArgArqAcgTyryal^roAseLeuGluAUArgyalLysSerValArQ^ ■:. 

2*01 ccccTC<;ccyJiGexc«cxw<Ac«S<wtc«TCCC<alooK^^ 

At^l.euAl-LysCluV«lM...tOlu..iyV*lTyrPtoUvM\l*Vdli;r^Lcu3luV.;iiuV4l^ 

FiC. 2. DNA Bequcnce and dsducei amino acid sequence of Ihe' Taq Poj I gene. Nucleotides were 
numbered consecutiyelyjrpra the start of tl^^^ f . V 

arc shown on the r/fihf.. • . i. - . ■ = ■= • •■■ = •• 
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12 3 4 



14.3- 




Qr 




6.7- 


M . 


4.7- 




3.8- 




3.2- 




2.8- 




2.6- 


• • • 


1.92- 




1.1- 






! . ..Firu f llcalrlcClon mnpHof DNA f rn((mc nU ediiiij In lri|;i he 

1 .^aq.P 1 I g^nfr.^. ^he 4.6;kb //md|l(.B fragment ood Ibo jJ.O-kb 

: -iS/mdIII A rragmenl. Restriaiori alUa are? ft/haiir(]^, 

; 'BamHl Xfla);^^ {Bg):A$pfimAsS, and 5plir (S;>}> B/feifpartsIon ^ 

j aKowtrig the ToqrPol I cddlnrregibA {bQldiUnty, Arhiu^{^'^^^i^ • 

. N.tenniiiuft of tlrpiiene. Dotted line - -) indicalea >«^lM;aecRteiice 

' Keatriction aheM^ as ahova^and HstKll {Hsh.XhoUXkU Jigil ih 



Fig. 3. Southern blot analysis of T^^ 
with a-"P-Labeled PCR*ainpiirted probd."£ane i is a 9ize sShB-. 
ard EcoRI- and Bam Hl-digestecl XpIacS and Afs/ir-digested plasriid 
Lac5. DNA fragment sizes (in kilobascs) are listed at fe/f..The PCR- 
amptified probe contains the AgtU primer sequences on either end 
(flanking the £coR[ site xnlacZ) which are homologous to sequences" 
in the 14,300 and 6J00 marker bands. Lanes 2~S are Toq genomic 
DNA digested with //indlll. /fiVidlll and P$t\, Psti, Pstl and BomHI, 
and BamHlt respectively. 

" Table I 
Taq DNA polymerase activity in E. coti extracts 



— • .:. -1 ■ L„3.4. 5 a -7- e.-.;«— .* * 

- " :;ctlvi-., NMv\|v:n 

. . _ ••^'i:y-:-^,...-r- ;:»r;^ 



f>«i?A-v-in »rj Urn /V. coil h 



Expertmeat 


Extract 


iPTG 


Specific 
activity' 




I 


BSM13" 


± 


<0.00l* 






BSMl3*w/Tnq' 
BSMl3*w/Taq^ 


+ 


0.142 








0.136 






pFC82.35 




0.248 
-0.310 






pFC82.2 




: 0.031 










0.002 




H 


BSM13* 




0.003' 


■ 




pFC84 




1.24 

:29,7 






pFC85 


+ 


. 0.87 
29.6 






pLSGl 




4.4 
; 37.5 





* Specific activity in units/mg total crude extract protein when 
assayed, as described under "Materials and Methods,** on clarified, 
heat-treated extracts. ; _ 

* A background of 0.da4%;'input countk has tteeh subtracted Ex^ 
tract protein corresponding to 3 x 10' cells was assayed. * ' 

* Purified Taq DNA polymerase was added to a replicate cell pellet 
at time .of lysis. The assay contained 4 x 10' molecules of Taq Pol I. 

./Purified Taq Pol I, corresponding to t X iti! molecules, was. 
iadmixed with the BSM13* extract at timeof fl^y.V . . 

'A background of 0.002% input colints Has biSen. 'subtracted. ' 
BSM 13* specific activity represents two' times "background. 

described under "Materiafs and Methd^^^^^ are suninfid-' ^ 
rized in Table IL U^ing these fusion^ we" determined the* V 
reading frame of Tail Pol I at the' Nhel site at nucleotide • 
2043, the BamHl site At hucleotlde. 1780. and at four locations^; 
at or leftward of the Xhol site at nucleotide 1408. * ' " 

Assembling the Fuil-lefigth Taq PoVrGehe—ki Aeicn 
above, the Sphl and Asp7l8 dele.tanl^, pFC84 arid pFC85, 
produced thermostable polymerase iacUviiy upon induction.' 
However, the size of the induced baqd^^ ahti-1|aq . ; 

Pol I antibody in Western blots v/as smaller' tha \ 



. FiC. .5. Western blot analysis of Taq Pol J clones. Cultures oX 
Xaq Pol I clones were indyced.with lEX(5 ,ai iifc§cribXdiund.erltM9^ 
terials and Methods.*' Utyntfulffd and ^ samples wVr^'l^^^lVzed 

on SDS-PAGE gels and^subjei^edlo \V^ekem bfo^ 
described under "Mat erials'and Methods: •• /;a>ie /, Pharmacia lovi.- 
molecular weight marker: Molecular ?weights:( in thou!umd.s) are shown 
at left. Lane 2. induced USM13* (3:j pg) ncsnlive control. IxinpR .7 and 
4, uninduccd (0.04 unit, 33 /ig) antl induced (j.O unij, 33 pg) jiFCKTi/ . 
l^ncs 5 and fi, unindiiccil [OA}:} unit. 33 pg) nnd induced (l.O unit, Xi 
pg) pK('B-l. I^ncs 7 and «, unindtited (O.O.S unit, I T Vg) nhtt ImlifrbU 
(0.4 unit, 11 |ig) pLSGl. Lane BSM13'' (33 ,ig) plus 0.4 unit of 
purined Taq Pol L * 

T^q Pol I, i.e. approximaVely 6/j:kDa as ojpposfed tb" full- length 
:a4-kDa. Thus, we fell that the A: frrig^mentllackcd- the 
portion of the gene which \yould encode the N. termipus, 

Also mentioned earlier, all candidates frbrri the ACh35 
library which had beeh identified with the pT2fl9R: l^Vobe 
shared a common, approximately 4.5!-kb. /^mdlll fragment 
which did not hybridize to the probe. This Xrag^iheht^ the B 
fragment, was subcloned into BSMlS*, yielding plasmid 
pFC83iTho restriction map of the B frag:ment.was idelerrhined ' : 
(Fig. 4)> By comparrng' those mapping i-es^^^ 
fragment map with the results of Taq genomic Souiherh Blots 
VvbiitfiWithprdKl (Fi^5) wedeaucedthA ; 
B was likely to contain the ff', portion o( the TaiPbl'f genp.' 

The 724-bp fig/II-i/mcllli ^segnient of the fi'H^erit: was * 
60b<*loneid Into flomHI- ;and //indlllKii'^bsj^d^^^^ 
sequencing, an' ATG and subsequent opien readihg frame was ' 



was identical toUhe phase of.thie open* read ini^'ftkmy alJ the 
*'Ua"ehd*oflhe^Afragm^^^^^ • ; - - =-*vr. -.viv^i!.), - 
PCR' Amplification "coWfi^ the B dfid A* fragments 

in 'pFC?83 andpFC82:35 are contiguous in the ^a</-^en6me; 
Primers were chosen: vtffiich' ftanlied ttv6 presumed IrtUi'rtal " 
'//i/?dnrsi>c: MK138'(T6W in the-Miniprin/); :h t^^^^ 
siH6' oi Hin^lW and Fp25^.a 20-mer corfiplemenfary t; 'ifiucle- 
'otid^s 622-641 of the Tacj^P^ I ^?quenceVon side of 

//iridHi. Upon nfnplificnlion (3) of the XCh.isrgcfiomic phrig^ 



Fuxion* 



pticnutyiie' 



.FMlonONA 



:T»q. 



ANhe I 
Alia 15 
ADa 33 
ABa 35 
AXh 28 
AXho 30 
AXho 32 
AXho S3 
AXho 54 
AXh 59 



Olue 
White 
Blue 
Blue 
White 
White 
Blue 
Blue 
Blue 
Blue 



ri\%0 €7A G A lOG. ACC. TCG 



dqtionjind Expmsxon of Taq Pol I Gen^^ : 5431 

. . : V'** frame of a G + 6 wntentV67,9S6^ The DNA 

' - , AAGG j-p throui^ -^^6. Fig:;2) is compleipentary to 

*..: *i vs -^K®:^' Thcrmus 4hvirmophilu3 16 S rRMA 
. ; -i ■ 1 1. ii\ knimay. comprise a portion of thfe -ribosome binding site 
. - ^: for intti^ion of translation at the nrst ATGi 



DISCUSSION 



GAGACCZAr I TWC CCg gTA, .. . , . . - t:.., ... »"«'vuooiuri 

ccc £AG: AgCrd!E I ifiiT-iSffi SSS- SA.' : : . /Several groups have reported the cloning and expression in 



CT*G^AfCT f ^c5s^A5p*g5^ ;ot.<m<//i) /tam^TOe/7ff<^/faau^,(«, /J-isopropylmal^te dehydro- 
AmcTT C^ gCG V 5TAnc€dt75G"?5;- ' genasfe.(fct/B).from. TVicrmiw thermphUttsXBh tind the*9\ial 
CAA Kc. err ccc ^ ota^^ acc tcq ^restrictlort-modifieatioh system from TOcrmtw aquaticUs tlO) 
GAG GOG ccc r cCG'^ JW' AAT^! ! lijima effat W) selerted the mdh gehk tiom a T, /buus bftrtfal 
-- CAG gcgc^cgg t^gMSg.AgSigg: HMllldihTatf. iji pBm2 'by scr^eiiing crude extract 6f 

• Construction of fusions between J5' sequeoceajot tbe-TaqPot J ; pools of independent library trihsforintints at 60 'C for mcQate 
Melholk ^ fragment and tqcZa is j|escrit>ed.yrider ^*Mateti^ Na|;iihari k hL (9) :sel^6t6d directly 



I^The lacZa phenotype was deterroined. on 




X>GaL In-frame fusions resuK^<|"i(i blUe'fcol^Su 

of-frame fusions yielded whitVic<>Jo"nlesr • - -i t ^ '™-\7r;|»^*'*'V^*,yiu»^fi w^u^a compje- 

'The DNA sequence was^dfeferminedPkttfe^hSite of «5cKTuston:'^= rented a»7et^B'imitation in the E, coti host Slatko.et aL (10) 
nSMia* polylinkersequenccis shown io^the^ght of '(helK}ld1ihV< for 'expression'of I^I toelhylase in 

OroupinKs of three nucleotides indicatejtfe.reading frame.of tocZci. T<jq:pBRi22 libraries. However, Tool endonuclease aDnearpH 
Taq DNA sequence is shown to the left of lhe.bold line, for in-frame . ryfittn hP s^rtlvA f^t ^7 T ih 7^ >^fr^-«Ar^-^l " .^^^ 
(blue) fusions, the deduced frame "of theTaq Pol I genJis indicated: " ^O*?^ -active St 37 C in K CoU, Since dones with only the 
Hcstriction sites regenerated; (N/icI. BamHl) or generated {CtaH are restriction gene were viable m the absenc$r of inodification^ 
indicated by italics. The Taq Pol I nucleotide coordinates' (Fig, 2) at 
the fusion sites of the XAol /dcZa 1 
30, 962; AXho 32. 1266; AXRo'^G 
1050. 



I ..uc.cui.uc tvuru«mi« irig, ^, tti -Seversr gT6ups*haVe also reported cloning and expression 
fcisiohs are: iXho 28» Mil; AXho of DNA polymerases in E, colL Kelley e( oL (11)' cloned the 

^^ansduclngpTlage at a leVel orapproximately 4% ceff 
protein: HoWevBr, tWy were unaWf tb maintaiV^^^^^ 

• harbonhg= the PoIA;** gene; probably bec9U5^ bverpVbduction 
of Pol l ih^R 6£rfi is Fetlial to th6 cell: More recently. T4 DNA 
polymerase has been donejd and expressed in E. coli (12). In 
this case/it was necessary to clone the gene under control of 
inducible promoters such that constitutive expression of the 
gene would be minhnal. Attempts, to clone the gene under 
control of its own promoter in JS. coli were unsuccessful 

- probably because of the detrimental effect the polymerase had 
Oh the cell. We did not know if T^ij Pol r would be toxic to 
E: cb/i cells at 37 *C. White the ifi utfro' specific activity of 
Taq Pbl I at 37 'C is only a few percent of ithe specific activity 
at 75 we cotild not predict if the DNA binding activity of 
the enzyme might interfere with . normal cell £un^^ To 
avoid potential problems related direcVexpression of the 
gene in K ccli we chose to clone an epitope of the Taq Pol I 
gene by using Xgtll libraries and antibody selection. The 
epitope-expressing clone was subsequently used to select tbe . 
entire Taq Pol l gene from a library in XCh35. * 

We were *u nable to .detect 'ia thlirmostaUe'pbljnrnera activ- 
ity in cells infected (11) with inj^p^^^^ includ-. 
ing 4>i'2. The polymerase , assay; Is >xti:emely'.8^^ and 
can detect r molecule of polymerase, per W 
-Upon subcloning of the 8-kb pi-bbe-hyt^^^^^^ HmdIII A 

• ffaigment from ^4:2 into BSM13* aK^ of the 
subclone pFCS2M, a low I^yerbr Iherrao^^^ pilymerase 
activity was detected (Tk'ble I)J;fi^sed on th^ activity of 
purified Taq Pol 1 when adipixed with JE. coa'.ceUs, this actiyity. 
represenU two to three mojecu^^^^^ Pol. I per cell 




Fig. 6. Plasmid pLSGi; The 6.58 kb jila&mid contains a 3:41- 
kb segment of T. aquaticus DNA in a derivative of the plasmid vector 
BSM 13*. The bold line indicates^the 2^-kb Taq Pol { coding sequence. 
Expression of Taq Pol I is controlled by . t.he.fac promoter/operator. 
Construction of the plasmid is described under "Materials and. Mj^^b- 
ods." Restriction sites are as 'in legend to Fi^. ^ ^nd iTcoRI {£) fahd 



we observed the predicted PGR product of 86 bp (data 
not shown), indicating that the B and A/ffmdlll fragments 
are contiguous, A larger PGR product would -have indicated 
that there was another Hindlll fragment in the gene. 

.The assembly of the full-length .Taq Tol I gfe'ne -in^ plasmid . 
pl.SGl (Fig. 6) is described under "Materials and Methods." 
Cultures of pLSGl in DGlOl produced lPTG-indiicible ther- 
mostable polymerase activity at 37.6 units/mg protein (Table 



I). Western blots of IPTG-indCiced pLSGl/DGlOl cultures - equivalent. The gene was'jpcalize^i to of the 8.0-kb 



revealed an immunoreactive band 6f appr ximately the same 
size as full-length Taq Pol I,.94rkbi (Fig!, 5). Ck)omassie 
staining of IPTG-induced pLS'Gi/DGlOl cultures failed to 
indicate the presence of a detectable induced band The com- 
plete nucleotide sequence of the Taq P'oi^ T gene and the 
deduced amino acid sequence are presented in Fig. 2. The 
DNA sequence of the Taq Pol I gene predicts an open reading 



Hindlll A fragment by u^ing deletion analysis/ .Upon IPTG 
induction, pFC^. the Sp/il deletion, and pFC85, the AspllS 
deletion, yielded a 100-fold increase m .Taq' PoV I activity 
(Table I) compared to that .of; the full-length A fragment 
subclone, pFC82.35. This inc;:ease.mactivi^ allow ady . 

detection of the induced proteinCs) on Western blot (Fig. 5). 
The A fragment induced proteins were truncated with an ■ 
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apparent molecular mass f63-^„ 

Fusing the 6' Hinilll site in the A fragment with the 
HtndlU site in BSM13* causes the Taq i*ol I gene to be out 
of frame with respect to fl-galactosidase. The reading: phase 
at the HindlU site in BSMlS^ with respect to /?-galactosidase 
is A AGC TT, a frame of "O" (13), The reading frame of Taq 
P 1 1 at the Hwdlll site is AAG CTT ("plus D. The ftision 
gives rise to a minus 1 frame shifL In. the /J-galactpsidase 
reading frame, there is a TGA stop codon at nucleotide 1478 
of Taq Pol L Downstream of this TGA there are several 
possibilities for restarte which could result in truncated forms 
of Taq Pol I: ATGs at nucleotides 1609- apd 1752 khd GTGs 
at nucleotides 1647, 1669, 1722, and 1731. Iri fact, we see a 
* doublet in induced lanes of both pFC»4 and pFCSS on West- 
cm blots (Fig. 5) indicating at least two reinitiation sites. All 
but one of the likely sites, the ATG at nucleotide 1509, would 
probably require a ribosbme binding'site for' reinitiation. 
There are reasonable ribosome binding sites for the GTG at 
nucleotide 1722 and for the ATG at nucleotide 1762. Trans- 
lation initiating at these sites would yield proteins of 59 and 
68 kDa, respectively. However, the apparent 'molecular 6iasses 
f the doublet bands seen on Western blote of pFC84 and 
pFC85 are approximately 65 and 63 kDa, based on comparison 
of the mobilities of the doublet bands with the molecuiar 
weight size standards. Whether the result of reinitiation or 
pr teolytic processing, the thermostable, enzymatically ac- 
tive, truncated forms of Taq Pol I directed by plasmids pFC84 
and pFC85 (Table I) suggest that significant portions of the 
Taq Pol I sequence are riot essential for DNA polymerase, 
activity. 

The purpose of the set 6f fusions of 5' portions of the Taq 
Pol I A fragment with lacZa in BSMI3* was to confirm or 
determine the reading phase of the Taq Pol I gene internally 
as an aide to nucleotide sequencing. Since we knew the reading 
phase of lacZ in the BSM13'*" polylfnke^, we could infer the 
reading phase of Taq Pol I in a-complementing in-frame 
fusions. DG98 harboring fusions which were in-frame were 
readily detectable as blue colonies on X-Gal indicator plates. 
We generated a series of fusions (Table II) at nine sites 
between nucleotides 962 and 1782 of the Taq Pol I gene. 

We compared the DNA sequence ofTTaq Pol I with that of 
E. coli DNA polymerase I. At the DNA level; the two genes 
lack any significant regions of homology (Table III). In regions 
where the amino acid sequences are homologous, the DNA 
sequences diverge, especially in third positions of codons. The 
longest stretch of DNA sequence identity is 19. bases (Table 

ni). 

The predicted amino acid sequence of Taq Pol l is shown 
in Fig. 2. From this a codon bias table was. generated (Table 
IV). There is a heavy bias toward G and.C in the tKifd position 
(91.8% C and G) as would be expected fe^ir GCTn'c'h' ^^^^ 

Table III 



Codon u^age in 



%tht1 



Arg 

|76) 



Thr 

(30) 



Ala 
(91) 



Asn 
(12) 
His 
(18) 

-Asp^' 

(16) 



CGt 

CGC:,^ (124) 
0 

. 0 ; 

'0 
20 
10 
0 
*2 
11 
12 
0, 
0 
12 
0 
18 
"*3 



TABLgVlV :r., 
f, aquaticus DNA polymerme I ern* 



CGG 
CGA 
ACQ 
AGA 
ACT 
ACC 
ACG 
ACA 

GCt' 
CCfC 
CCG 
GCA 
AAT 
. AAC 
CAT 
CAC 
GAT 

ATG 16 



■Frd 
(48) 



,Gln 
.(16) 
Glu 
(87) 
Phe 
(27) 
• -Trp-. 



TtO 
TTA 
CTT 
CTC 

ere 

CTA 
CCT 
CCC 
CCG 
'\GCA 
•OGT 
• CGC 
GGG 
-GGA 
CAC 
vGAA 
GAG 
GAA 
■ TTT 

. *«JG 



Ser 

:C(3li: 



3 

20 
46 

m (51) 



9 
2 

• 0- 
28 
30- 

.0- 

79 
8 

19 



lle= 
(25) 



TOT . 

Tea.. 
Wg 

TCA 
-ACT , 

■ AGC V 

ore 

GTG 
GTA- 
'ATT • 

ATA 



0 

1 

0 

: 1 
14, 
-Mo: 
21 
29 

izb'-' 

2 



IVr : TAT 

:(24).-r.vTAa 
Cys TGT 
(0) TGC 
L>^ ^AAG 



0 
0 
37 
5 



and as others have observed for other Thernius genes: .95% C 
and G for the ^^24 gene encoding L-lactate dehydrogenase of 
Thermos caldophilus (15), 94.8% for mdh from T, flauus (14), 
and 89% for leuB from T. ihermophilus (16), 
^^ificft»t;^fflLinQ.^$i<l^ 

iraSe: One^ possible seqiierice. alignment yields 38%- identity ■ 
between the Taq Pol I and K coli Pol I artiinb acid sequences 
(Fig. 7). There are two major regions of taq Pbl I And one 
region of T7 DNA polymerase that show extensive sequence 
similarity compared 1o E. coli Pol I. The fi ret region of Taq 
Pol I extends from the N terminus to approximately re.sklue 
300. The second region extends from approxiihately residue 
.410 to the C terminus of taq Pol I. The. N-t^rmjnal region .f ' 
Taq Pol I corresponds to the N-terminal domain of E, coli 
Pol I shown to contain the 5'-3' exonuclease activity (17). 
The C-terminal regions of taq Pol I and T7 DNA polymerase 
correspond to the coli Ppl-I domain showiv to contain DNA . 
polymerase activity (18). The x-ray structure erf the Kienow 
fragment (r9)'shows"that this domain contains a deep cleft 
believed to be responsible for DNA binding. 

Apparently as a result of many mutations^ deletions, inser- 
tions, etc. during evolution," Taq Pol I residues at positions 
300-410 show little sequence similarity compared to E, coli 
Pol l, Taq Pol I is 96 residues shorter than coli Pol I; most 
of the deleted residues occur in the region encompassing 





Sequence 
location' 


Nucleotide 
tdefitity 


_ , ■ ' 

- 'Amino iacki' i; 
identity 


Taq Pol I 
Poll 


190-208 
178-196 . 


19/1? 


6/6 


Taq Po! I 
Poll 


1730-17$7 
2015-2042 


:= 23/28 




Taq Pol I 
Poll 


2260-22Z7 
2545-2502'^ 


. - 17/18 , 




Taq Pol I 
Poll 


2344-23Gaf * 
2636-2654 ^ 


n/20 . 






Nucleotide sequence coordinates for £. coli Pbl I adapted Yroiii 
GenBank. - •■■ - ' ' 



fragmentof£. cob*. Poi I that contains only;residues 515-928 ! 
is devbid df3'-6' exonuclease activity, but still retains polym- 
erase activity (18). Presumably, the E. coli Pol I region com- 
prised of residues 324-515 forms at least part, if not all. of 
the 3'-5' exonuclease activity; Taq Pof Pand £?. cofi.poht* • 
display littfe sequence similarity In t the tpresomptive' ; 
exonuclease region. Of the ^S'co/r Poll f&sidfie^ shoivn to. be • ^ 
involved in cation arid debxynutlcoside monophosphate hind-.^ 
injj; the scquence-arignmenl of Fig. 7 shows only A'ap.^24: ris • 
havingan exact hdmbloff in the Taq Pol. I Sequence. Although . 
Other high scormg= sequence alignments ate possible in the 
taq Pol I 300-^410 region, it is possible that: the-rTaq Pol I 
gene hias undergone key mutations, deleti ris. or insertions 



..r.i.-s 



B. C. 

C. C. 



I HRCMt^tFEPKcfiVLLVDClflllAWTri^ 

» ii liim fMnh Vri III ' rfii 'f"i;".T r iiruh ih Y i in; in nii' n *| •* 



100 KELVDLI^ZARX^VPGTtiUjbVlj^PUCKA^^^^ 

II iMi I'ljijiinnrin Ml 1 / i iijirrnii i in ir i/r iiui U ,11 n ihiiMiiti 

96 MAMVKA«CLPL1^VSCV£W>0VICTIJUUUE*^ " 
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198 




T.«q. 281 ILHEFGLLESPKALCEA^., ...... ■ 

H I IMJ I I , : ^ . : .!> I. ... 1 I Jl.ll HAlirJIrl ^:|1. II :ir . 

B.C. 296 WLQWCGAKPKAKPQEtSV:j)EAPEyTXryiSXDMTn/TIto . 



i>. r 



•351 
..3S5.' 



B.C. 

T.*q. 

E.c. 
T7 



352 LAKDLSVUaftEOLGLPPG. 
396 ISRERALELUCPLLEDEfUUicVG^ 



412 



411 

487 



I III II i .::i fii.m 1 1 ;i 1 ii 1 U;:, niii .11 r 111 11 ri '.iin uiimiim 

488 QIXLEEKGRYAAEDADVTLQLHLKlWpbLQKHkCPU*^ 

.: ■ . . . .11 HmMI 



E.C. 
T7 

T.aq, 

E.c. 

T7 

E.c. 
T7 



492 R\a-FDELGLPAIGkTEKTC(CRSTSJU^\aBj^^ 
Mill MM M»i 1 l-M il'lM*''' 



. V EAHP IVEKI LQYI^L- 

V ' iP^ i i MV' lM 

588 T I LFE KQC I KP LKJCT . PGG AP STSEENO-'EE tJC:\- r . UirP LPKVf LEYRG 




DPLPOLIH .• . PRTCRtHTRffMr 



LKSTYt rPKLPLHIN; 

i .1 1 M 1 1 



• I I I II III ilj llllii/i :t: i;:fi liM' ir , 

343 KKLQE . AGWVPTKYTOK .CAP\nrtlDEVTXC\mypDPEKQJU^ipiil«tEyi>riQKRrGQSXEGOKA>^ 



V» « < I »ii^niiUif»iihj^ lUHHd n^ifhi Arm I 

PKTGSVHTSXHQAVl tlTGMiSsjTDPKLQMIP . 
I lUt.V r iT Ml 1 I Ml Ml II 



58 8 TPLGQRIRJ^AFIAE. ". :.. . . _ EGWLLVMX>VSQIEUlVIJWLSGOEMLIRVFQ^^ 

IIMIIMM:< MMIM*^^lltitMMMIMI I M IMMIM III Ml M < 11 1 ! I I M.|| 
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Fic. 7. Amltio aoid.keqceirce cp£t>&Hscrs^f the- DNA polymerases from T. aquaticua, E, coli, and 
bacteriophage TT.,Deducejd ap:i1no oUd fiequKices for Taq Pol HTioq.), E, coli Pol V{E:c.h arid bacteriophage 
T7 DNA polyn)eraae-(77) we re .analysed for .amino. acj4 sequence, homology by using the computer program GAP 
from the tlniversUy of Wiaconain genetics Computer Group. The alignment waft obtained bV using -the mutatidnal 
data scoring matrix of Staden {lOirViiiicalmQrfU or functional relatednesa between 

pairs of residues In the thfee seq^enc^^; Haf^t^Hfcd/ niafJb denote Btnind'acid identities orfunctrbnal relatedness- 
between residues. in Teq Poll a^uiTTONA poIyuierase^n or A". ^^^^^^^^ , i. -: . ^ 



that have destroyed its 3't5' expnuclease.activity. Preliminary: 
results indicate that Taqlfeoll display?: Tutl^^if.anst, 3' fSV 
exonuclease activity. ' '{ .r- r.. .-.-v.r ,v , ; 

Sequence homology be.iw.ee n : E.: eqU Pol J.and /H.-DN^^ 
polymerase has been previously noted.-Those.T^? P.NA 
erase sequences shown by OlUs .e.Mi/.(21) to .be conserved 
between that enzyme and B, coli Pol I are also present in the - 
Taq Pol I amino acid sequence (Fig. 7). Most Af the conserved . 
residues are found in stnictucal features'tliat form the DNA- 
binding cleft of the enzyme. Although short segments of T7 
DNA polymerase sequence in the 1-334 region are similar to 



regions in E, coli Pol I and Taq P61 I, the overall sequence 

similarity in this region, ighonhg.the first ;300 residues of 
..co/i Pol l and Taq Pol I that form* the 5'-3' exonuclease. 

doni pi Hi is. poor. A complete and unambiguous sequence align- . 
: ment (or .this region cannot be assigned It should be noted 

that although T7.DNA polymerase also shows little similarity 

tp E. coli Pol I ih the region of the 3'-6' exonuclease domain, 
, T7 DNA polymerase is reported to display significant 3 '-5' 

exonuclease activity (22, 23),' 
Berhad et aL (24) and Pizzagalli et aL (25) have identified 

several short regions of DNA polymerase amino acid se- 
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quences that are highly conserved Unserved sequences 
are found in polymerases from. herpes almplex^virv&tjrpe 2, 
human cytomegalovirus, varicella-zoster virus, Epstein-Barr 
virus, vaccinia virus, adenovitu^ 2, killer plasmld from 
KUweromyces fdctts, maize mitochohdrikl parttcle, bacterio- 
phage #29. bacteriophage T4i baicteribiphage PRDl/and yeast 
plasmids. Neither JB. coK Pol I, SPaq Pol I,* nfor h^ctejiophage 
T7 DNA pol^^erase conta^n&^e conserved sequences.npted 
in the polymerases from those. jBourcea. Aside from the ho- 
rn logy between Taq Pol I and either coU Pol I or T7 DNA 
polymerase, 6o aigniricant amino acid sequence Similarity wak 
found when a global homblogir. seaiifth' y^iSs: Aii^^^ cortiparing^ 
Taq Pol I to the National Biomedical Research Foundation's 
amin acid sequence database. ... : ' V 



A more likely ^xplan ^ /or the preference for Arg over Lys 
in thcrmosuble prqteiris wirould seen) to be based on the uniqxte 
physical^heinicai properties of the two.amtnb acids {e.g. pHm[ . 
values,: hydlrogen bonding patterns, hydrophobicity/hydroK 
philicity). ' • • - N 

y-- The trundaled flfnd full-length Taq Pol .1 enzymes produced 
upofi IPTG lnduction show different reactlvitled U> the fihti'-- ' 
. Taq Pol.l ai)Ubodx.,Eor Wcsteni blots (Fig. 5),.the immune^ , ; 
ireaclive band in the lane of induced ptSGl is-more .readfl}f,t! 
' detecrtable than induced ;pFQ84 or pFC&5, the Sphl, arid .\ 
AspilS A fragment deletions. In. fact, we loaded three times. . 
^ as ihuch of'the pFC84 md pFC85 eztractia compared tO' ^ 
pLSGl. and the resMltingpL^G] immuh'oreactiviebilnd is still 
ihdre ^Qt^inse^^ fn'fer that there are more epitopes for our 



Chemical modification and'inacUvation studiw of cofr ..Wti^.P^^^^^ 



Pol I have resulted in the identification of manyiamino acid PAGE gel slice, m the N.terinmal:?nd of Taq Pol I than m 

residues believed to be important or esse^tfal foV polymerase the Grterimnal two thirfs of the^ acti>oty.i::: 

activity (26-^1). Among these r^kidues arferMet^512; Arg^682. -"there^fs^at/ least a 3-fold differenwun reacUjdty wi^^^^ 

Ly8-768. Tyr-766, Arg-841, and'His-SSl. Cb^^ Taq antibody of the truncated t/erst^ the full-length form of the ; ' 

Pol I amino acid sequence to the.E^.coK Ifpll sequence,,^ tf^itiii L§^kA<\JMi:i^ 

the above residues, except Metr5i2,:ar^consim^ed;T^^ 
contains a Leu residue at the arialogdus position.: Apparentlyr 



the functionally similar Taq Pol I Leii residue at position 417 
can fulfill the role ascribed to K cdli Pol rMet-512 in template 
primer binding (30). ": \ 

Analyses of the effects of various mutations in the E, coli 
Pol I gene upon enzymatic activity have also been used to 
define amino acid residues important for polymerase activity 



?s|Kb?^^^ or Taq 

Xfl^uoetow^ln pLSGi <Fig.,^)-:.the.. 




yiniffsse.inr 

beginning of the Taq Pol I open reading frame is 109 bp distal ; • 
to the Bglll site and 171 bp/distal to the UxcZa translation/ 
initiaiion site. A ioV level oj^Taq.Pol.I expression in cefls "\ 
harboring pLSGl is consisteht with an in-phase TGA codon 
(-ill through -109, Fig. 2) in the Tat? DNA sequence causing 



» , ^, ^ A Iv-x- *V';;*.-«cA Yr^;»M■<:^ . Uranslaticm terminati^^ 

For example, a Gly to Arg-m<.tation-at:^.^9,l.dt, «50 (poW5) » j^^^^,^^^^ ^^e firet ATG'iei.lil^ in the synthesis of the 



results in a poljrmerase that, is less profcessive on the DNA 
substrate (32). An Arg to His mutation at position 690 {polAG) 
results in a polymerase that is defective in DNA binding (33). 
Both Gly-850 and Arg-690 are conserved residues in Taq Pol 
I. Joyc et qL (34) have characterized a number of E. cdli Pol 
I mutants defective in 6'-3' exonuciease activity. Interest- 
ingly, the four mutations. Y77C {polAlO?), G103E (po/- 
A4113), G1S4D {polA480ex), and 01920 (poL42f^) *ll occur 
at amino acid residues that are. conserved Jn Taq Pol I. 

As would be expected for an enzyme from a thermbphitic 
organism, Taq Pol I is corisiderably itioie thermostable than 
Pol I from E. coli (data to be presented in a later publication). 
Although a better assessment of an enzyme's thei-mostabiUty 
would result from a complete cataloging, of ail stabilizing 
amino acid interactions, in the absence of high resolution x- 
ray crystal structures, many ireseafchersi haVe at^tempte to 
explain enzymjinhermdMfeili'^y.by ar^^^ of^mmo acid 

content (35-37), Several features of thermostable enzymes 
have been noted in such studies: Among those features are 
increased ratios of Arg to^ LyS residues, .Glu :to Asp residues,-" 
Ala to Gly residue8,*^hr .to Ser 'reslduj^S, faijd a reduced Cys. 
content. Comparing Taq Pbll ito ;ii&irPdI 1, tlie Ala to Gly 
and Thr to Ser ratios are smaller fpr .Taq.Pol I .than for 
coUPol L Ofthe thermostabilizing type amino acid alterations 
that hold true, it is particularly notable that the Arg to Lyi 
ratio for Taq Pol I is nearly twice that for E. coli Pol t It is 
possible that the propensity of thermophilic proteins to con- 
tain Arg rather than Lys residues is simply a reflection of the 
high GO content of thermophilic organisms. The structural 
gene for Taq Pol I conUins 67.9% GC compared to a 52.0% 
GO content for E, coli Pol 1. The six Arg codons are rich in G 
and C (13 ut of 18 bases are G or C) compared to the two 
Lys codonis (1 out of 6 bases is a G). This explanation for 
amino acid preferences in proteins from thermophilic orga- 
nisms cannot be the basis for Glu versus Asp. Thr versus Ser. 
or Ala versus Gly preference, because there are equal ratios - 
of GC versus AT in the codons for those pairs of amino acids. 



94-kI)a Tti^ Pol I protein; Further manipMation of the Taq '- ' 
DNA polymerase sequence has increased the level of expres- ' " 
sioh.* The cloned full-length Taq Pol l gene in pLSGl affords 
the advantages of expressing TaqTol I in E, coli arid in ease 
of isolating the enzyme from coli compared to T. aquaticus. 
These advantages will aid in further study of the enzyme and 
will provide a ready source of Taq Pol I for use in PGR arid 
other biochemical procedures in which Taq Pol I might prove 
useful; such as in DNA sequencmg/ . . . * : ... . . : 
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